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Abstract 

In this paper, we consider networked estimation of linear, discrete-time dynamical systems monitored 
by a network of agents. In order to minimize the power requirement at the (possibly, battery-operated) 
agents, we require that the agents can exchange information with their neighbors only once per dynamical 
system time -step; in contrast to consensus-based estimation where the agents exchange information until 
they reach a consensus. It can be verified that with this restriction on information exchange, measurement 
fusion alone results in an unbounded estimation error at every such agent that does not have an observable 
set of measurements in its neighborhood. To over come this challenge, state-estimate fusion has been 
proposed to recover the system observability. However, we show that adding state-estimate fusion may 
not recover observability when the system matrix is structured-rank (S'-rank) deficient. 

In this context, we characterize the state-estimate fusion and measurement fusion under both full 
5-rank and 5-rank deficient system matrices. The main results of this paper are the following. Firstly, 
we show that when the system matrix has full Shrank, state-estimate fusion alone (with no measurement 
fusion) can recover the observability. Subsequently, we characterize the minimal topology for inter-agent 
communication required for a stable networked estimator. Secondly, we provide methodologies to recover 
(networked) estimator observability when the system matrix is S'-rank deficient. In particular, we introduce 
a novel agent classification based on their local measurements and identify the agents that are crucial 
for stable estimation error. We then provide topology modifications and sensor placement techniques to 
recover observability in the 5-rank deficient scenario. Finally, we provide an iterative method to compute 
the local estimator gain at each agent that results into a stable estimation error once the observability is 
ensured using the aforementioned techniques. 

Keywords: Networked estimation, Observability, Structured system theory, Generic rank 

I. Introduction 

Estimation of dynamical systems with observations distributed among a network of agents is an 
important field of research, where the idea is to assign a group of agents to monitor a certain system 
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Fig. 1. A group of mobile ground-robots are tracking a large flock of quad-copters. Every ground-robot is monitoring part of 
the flock, and then shares its partial data with other ground-robots through a communication network. 



or phenomenon of interest. Agents are distributed in the sense that each agent can only measure some 
of the states of a dynamical system, referred to as local measurements. For example, a group of sensors 
spread geographically over a large region to monitor daily temperature evolution. The measurement data 
and dynamical models are further corrupted by noise and disturbances. The objective is to enable each 
agent to make an unbiased decision on the global state relying only on its own measurement and the 
measurements from its immediate neighbors. Such a scheme is often referred to as networked estimation 
where the term network implies that the information is restricted on a sparse network. 

Networked estimation is preferable to a wide range of applications as it is scalable and further requires 
less communication load at each individual agent, in contrast to the centralized case where each agent may 
require repeated long-distance communication to a central location. Applications of networked estimation 
include social networks (T) to learn global beliefs based on partial understanding of the state of the society, 
market, politics, etc., monitoring physical processes and environmental spatio-temporal fields (2), |3j, state 
estimation in power systems (4)-|[6), and multi-agent systems such as collaborative target tracking and 
flocking of mobile robots [7|, see for example Fig. [T] 

A variety of solutions exists for networked estimation starting from the earlier work in |8), (9) and 
references therein on parallel Kalman filter architectures for all-to-all connected networks, to more recent 
diffusion-based schemes via least mean square implementation, such as in Kalman filtering and smoothing 



1 10 1 and distributed binary detection [11 1. Meanwhile, incremental adaptive distributed strategies can 
be found in (T2), |T3| along with distributed moving horizon estimation (T4) to minimize estimation 
error variance for constrained problems. State estimators based on low-cost single-bit data transmission 



is proposed in (T3J with binary sign of innovations (sign of difference of measurement and estimated 
value). Information theoretic approach based on consensus over the Kullback-Leibler average of Gaussian 
PDFs is exploited in p6|. The literature can also be classified into static and dynamic estimation. In static 
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Fig. 2. (a) The traditional two-time scale consensus-based approach, (b) single time-scale approach. 



estimation (2), pO|-fT2|, |17|, (18J, the target state to be estimated does not change over time, while 



dynamic estimation Q, (8), fl4|-[16|, 1 19|-|23| takes into account the time-evolution of the systenf] 

Consensus-based strategies have recently found a lot of interest in the context of sparsely-connected 
networks, where the main focus is to reduce the uncertainty of individual estimates by averaging on 
collaborative data. Early work in (T9j-|23) considers a two time-scale method where consensus is 
implemented at a time-scale different than the system dynamics. These results require that a consensus is 
reached within every two time-steps of the system dynamics, and is thus, challenged with a large number 
(infinite, in general) of consensus iterations between every two steps of the dynamics. To elaborate this, 
consider Fig. |2ja), where a large number (—> oo) of data fusion iterations are implemented between every 
two successive time-steps, k and k + 1, of the dynamics. This approach requires communication over a 
much faster rate than the sampling of the dynamics, and thus, in general, becomes practically infeasible 
when the underlying system is operating under power constraints and has restricted communication and 
computation budgets. 

In contrast to the two time-scale approach to distributed estimation, recently References (T), (l6)-|[T8j, 



|23|-[25| studied the behavior of networked estimators when the communication time- scale is the same 



! As stated in [17], diffusion algorithms can be extended for non- stationary (dynamic) tracking when the target is not moving 
too fast, i.e. its state is relatively stationary over a period such that the algorithm can converge. 



August 21, 2012 



DRAFT 



4 



as the time-scale of the dynamics, as shown in Fig. [2jb). This method is practically feasible for real-time 
applications and computationally efficient as compared to the two time-scale approach. A preliminary 
study on this single time-scale estimator is carried out in (24) , where it is shown that a particular linear 
networked estimator has a bounded estimation error if the two-norm of the system matrix is less than the 
network tracking capacity-a function of the communication network and observation models. Notice that 
in the two time-scale method, the communication network becomes irrelevant due to more information 
exchanges among the individuals (the information in a sparsely connected graph is equivalent to the 
information in a fully connected graph when a large number of information exchanges are carried out). 
Therefore, the performance and properties of the underlying estimator depends only on the data fusion 
principles among the agents. However, in the single time-scale scenario of Fig.[2](b), the underlying agent 
network remains sparse and an arbitrary communication network may not suffice to make the networked 
estimation error stable (e.g., see (24), (26), (27)). 

In this context, the key problem is to design the structure of the inter-agent communication according 
to the underling fusion rules in order to recover the observability of the networked estimator. In this 
paper, we use a variant of the Networked Kalman-type Estimator (NKE) protocol, initially introduced 
in (24). The main contribution is to determine the communication network among the agents to recover 
the observability of the underlying estimation protocol, given that each agent may not be locally (in 
its neighborhood) observable. We study the observability with a structural point of view (28)-(34) in 
the sense that we explore the generic properties of the system. The generic properties are applicable to 
any choice of system parameters as long as the sparsity structure (zeros and non-zeros) is not violated. 
The generic approach is helpful when the underlying system parameters may change depending on the 
system operating point (linearization of non-linear dynamics) and is further significant in communication 
network design as the approach is independent of the exact value of the weights chosen for data fusion. 
Moreover, this implies that for smooth non-linear systems with fixed structure Jacobian matrix, similar 
analysis of the networked observability can be applied. 

Comparing with other work in the literature, we consider single time-scale estimation, as opposed to 



the multi time-scale estimation in (19)-[[22) and the vanishing time-step algorithms proposed in [ [35] , (36). 
Unlike (37) , (38) , we do not impose an agent hierarchy (i.e., we assume the processing/communication 
duties at all agents are the same). Avoiding agent hierarchy increases the reliability of node/link failure. 
We further do not require the communication network to be (strongly) connected fLO) , (TT), (T6) , fT7) , 
(20) , [26] or for it to include a cyclic path (12), (18). Our goal is to design the network with minimal 
communication. Specifically, we use methodologies that are independent of exact system values and rely 
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only on the structure of the underlying system. This leads to a robust estimator design where the analysis 
is not algebraic, as in the conventional Grammian or PBH observability tests, but graph-theoretic |30|, 



[31]. 



We now describe the rest of the paper. Section [TT] provides preliminary material on basic dynamical 
system estimation and structured systems theory, whereas Section [III] presents our problem formulation. 
Section [IV] enlists our assumptions and describes a novel agent classification method. Section [V] covers the 



main results of this paper on state and output fusion, whereas local gain design is explored in Section VI 



We provide an illustrative example and simulations in Section |VII[ and finally, Section |VIII| concludes 
the paper. 



II. Background and Preliminaries 
We consider the system model to be a discrete-time linear dynamical system: 



(1) 



where x& E W 1 is the state vector, A = {a^} E R nXn is the system matrix, and v& ~ Af(0,V) is 
the system noise. We note that the main emphasis of this paper is not on modeling but on structural 
characteristics of the underlying system and the results we present hold for any phenomenon following 
Eq. ([T]). We assume that the dynamical system is monitored by a network of N agents such that each 
sensor i has the following observation model: 



y\ = Ci* k + r 



(2) 



where y\ E W l is the output vector at agent z, y\ ~ AA(0, Ri) is the output noise, and C{ is the output 
matrix at agent i. With this notation, we can write the global observation model as 



(3) 



where 









' Ci " 




r 1 
r k 


Yk 




, c = 


Cn 




r k 



(4) 



r k ~ A/"(0, R) is the global observation noise with R = blockdiag[i?i, . . . , Rn], and C = {cij} is the 
global output matrix. 
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A. Centralized estimator 



Let be the centralized Kalman estimator [39| at time k given all the observations, y&, up to time k. 
It can be shown that the error, e^ k = x& — x^ fc , in this estimator is given by 

*l\k = ( A ~ KcCAffi^^ + (5) 

where if c is the centralized Kalman gain and the vector 77^ collects the remaining terms that are 
independent of e^_ 1 i fc _ 1 . It is well known that the centralized Kalman error, e^ fc is stable if and only 
if all the unstable modes (eigenvalues) of the system are observable. For the ease of explanation, we 
assume that there are no stable unobservable nodes. In other words, detectability and observability are 
equivalent throughout this paper. 

In the traditional sense of n-step (A, C) -observability, the observability Gramian is given by 

O = [C T A T C T ... (^" 1 ) T C T ] . (6) 

Algebraic tests for observability check the Gramian, O, for being full-rank or the matrix O t O for being 
invertible. An alternative method is the PBH (Popov-Belevitch-Hautus) observability test pO) , which 
requires the matrix, [A T — si C T ], to be full-rank for all s. The matrix [A T — si] is full rank for all 
values of s other than the eigenvalues of A and, therefore, the PBH test is needed to be checked only 
for these values. 

Note that, both these algebraic methods rely on the knowledge of exact values of each element in the 
matrices A and C. However, in many dynamical systems, only the sparsity (zero and non-zero pattern) of 
these matrices may remain fixed while the non-zero elements are subject to change. For example, when 
the elements of the concerned matrices depend on certain parameters or operating points. Hence, these 
conventional methodologies fail to check for observability in such cases and graph-theoretic techniques 
are to be employed. We introduce such graph-based methods below. 



B. Graph notations 

Let X = denote the state set, and let Y = {yi, . . . , y p } denote the output set. We 

define the system digraph as Ga = (V,E), where V = X U Y is the vertex set, and E is the edge 
set containing directed edges, (^1,^2) £ E, of the form v\ —¥ V2 with v\,v<i E V. The edge set E 
is defined as Ea U Eq, where Ea = {(xj,Xi) \ aij ^ 0} and Ec = {{xj,yi) \ ^ 0}. A path of 
length £ from v\ G V to V£ G V is such that there exists a sequence of vertices, v\, 1*2 , . . . , V£ with each 
subsequent edge, (v\, ^2), (^2, ^3), • • • , (^-1, vi) E E. Here v\ is the begin-vertex of the path and v# is 
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6 < 




Fig. 3. This digraph is an example of (A, C) observable system based on the Theorem [T] 



its end-vertex. Here, we assume that each vertex contained in a path occurs only once {simple path). A 
path is said to be Y -topped if it ends at a vertex in Y. A digraph is called strongly connected if there 
exist a directed path from each vertex to every other vertex in the digraph. In a not strongly connected 
digraph, define Strongly Connected Components (SCC) as its maximal strongly connected partitions or 
sub-graphs. A cycle is a simple path where the begin and end vertices are the same. Since the nodes in Y 
have no outgoing link, nodes included in a cycle all belong to X. As an example, consider Fig. [3] which 
shows the system digraph of a dynamical system with n = 7 states (encircled) and N = 3 measurements 
(or agents) denoted by squares. We now introduce some new concepts on SCCs over state vertices. These 
will be helpful in describing our results. 

Definition 1 (Parent SCC): A state SCC, is called a parent SCC, if it has no outgoing link to any state 
vertex not belonging to itself. 

Definition 2 (Child SCC): Any SCC that is not a parent SCC is a child SCC. 
Notice that the set of disjoint state SCCs in system matrix A can be explicitly characterized as either a 
parent or a child. As an example, the SCC containing vertices {4, 5, 6} in Fig. [5] is a parent SCC, since 
there is no outgoing edges from its states to other states {1, 2, 3, 7} not included in it. Furthermore, {1,2} 
and {7} are child SCCs. More details on parent/child SCCs and efficient algorithms for computing SCCs 
in a digraph can be found in [27] and (41), respectively. We now use the concepts from this section to 
formally introduce structured systems theory and generic properties. 
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C. Structured systems theory 

Structural analysis deals with system properties that do not depend on the numerical values of the 



parameters but only on the underlying structure (zeros and non-zeros) of the system |28|-(34). It turns 
out that if a structural property is true for one admissible choice of non-zero elements as free parameters 
it is true for almost all choices of non-zero elements and, therefore, is called a generic property of the 



system (42) . Furthermore, it can be shown that those particular (non- admissible) choices for which the 
generic property does not hold lie on some algebraic variety with zero Lebesgue measure, for more 
details see (42), (43). 

Definition 3 (S-rank): The structural rank (also called generic rank) is the maximum rank for all 
numerical values of the non-zero entries of the matrix A. It is, in fact, an upper-bound on the numerical 
rank of A. 

The S-rank as a generic property holds for almost all choices of nonzero parameters of the matrix, A. 
It is equal to the cardinality of the maximum matching associated to the bipartite graph associated to 
the matrix, A. In the algebraic sense, this is the maximum number of non-zero elements in distinct 
rows and columns of the matrix, A (44) . Details on the generic rank implication in graph theoretic 
sense and algorithms on maximum matching can be found in (30) , (41). Among other generic properties, 



controllability/observability are of interest in the context of this paper, see (29) , (30) , (33) , (34) , (45) , 
1 46 1 for details. We extend the following theorem from the generic controllability results in (3TJ. 
Theorem 1: A dynamical system is generically observable if and only if in the system digraph: 

(i) Every state is the begin-node of a path that ends in an output (termed as a Y -topped path); 

(ii) There exist a disjoint union of F-topped paths and cycles that cover all the state vertices. 
The following lemma is from (28) . 

Lemma 1: The condition (ii) in Theorem [T] on the generic observability of (A nxni C mXn ) is equivalent 
to the following: 

' r A 

C 

The proof of Theorem [T] for generic controllability and Lemma [T] is given in (28) , (3T) , where other 
equivalent graph-theoretic conditions to generic controllability (observability) are also defined that we 
omit here. As an example, consider the system shown in Fig. [3] It can be verified that each state is a 
begin-vertex of a F-topped path, and {7}, {4, 5, 6}, {1, 2, 3, a} constitute a disjoint union of cycles and 
y-topped paths that cover all the state vertices in X. Thus, satisfying both conditions in Theorem [T] and 
the system in Fig. [3] is observable for almost all choices of non-zero elements. 
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D. Corresponding graphs 

In this paper, we deal with two different graph representations: system digraph, Ga, representing states 
of the dynamic system ([!]) and ([2]), and digraph Gw defining the agentcommunication network. Let Gw = 
(Vw, Ew), where Vw = {1, . . . , N} is the vertex set consisting of N agents, E\y = {(i, j) \ i ^— j} is 
the edge set, and T>i = {i} U {j | (i, j) E E} denote the extended neighborhood of agent i. Notice that, 
unlike many works in the literature we do not constrain Gw to be undirected. In fact, no assumption on 
the topology is considered here, as designing Gw is a contribution of this paper. 

Example: To shed more light on this, we give an example here. Consider the flocking motion example 
given in the Fig. [T] The position, velocity or acceleration of every quad-copters can be considered 
as a state of the system, having ground robots as agents, the coordination law among them, typically 
following dynamics ([]} |47| , defines the dynamical system, A, and system digraph, Ga- The system 
outputs/measurements are the states tracked by the ground robots, and the communication network (to 
be designed) among these ground-robots is Gw> 

III. Problem formulation 



We employ a variant of the Networked Kalman-type Estimator (NKE) proposed in |26|, |27|. Let 
be the state estimate of agent i at time k given the outputs up to time m, (m < k), from its neighboring 
agents, j E T>i. Each agent implements the following: 

(i) Predictor and state fusion: 

*l\k-i = J2 Wi i A *l-i\k-i ( g ) 

(ii) Estimator and output fusion: 

*\\k = *k\k-i + K l c J(yi ~ c j yL k\k-i) ( 9 ) 

jeDi 

where W = {wij} is the state fusion weight matrix such that Wij > with J2jev w ij = 1 (W is 
stochastic), and K l k is the local estimator gain at agent i. 

Remark 1: Following are some useful remarks: (i) The diagonal entries of W are all nonzero, since 
every agent is in its own extended neighborhood and uses its own information, (ii) The first equation ([8]) 
is a local prediction fusion where each agent i fuses the neighboring estimates from time k — 1 and then 
implements a local predictor, (iii) In the second equation ([9]), each agent i updates its local prediction 
with an innovation term. We define this innovation as the difference between the state prediction of 
agent i and the state measurements obtained via agents, j E T>i. Adding this term, agent, z, makes its 
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final estimate, xjL fe , for the current time step, (iv) The protocol given in Eqs. ([8])-(|9]) takes place at the 
same time-scale as the system dynamics, see Fig. [2] Notice that both Eqs. ([8])-(|9]) can be combined into 
one equation; we give separate equations for the ease of explanation. 
Let the estimation error at agent i and time k be defined as 



*k\k 



x 



k\& 



(10) 



and let e k = [(e^ T 



- kJ , . . . , (e k ) ] be the networked estimation error derived in the following. 
Proposition 1: Let be some function of the system and measurement noise, v k and r z k , independent 
of ek-i and let 



K k = blockdiag^,...,^] 
D c = 







cjc 3 



^N\T-]T 



q fc = [(qi)V . . , (q£ 
Then we get the following networked error dynamics, 

e k = (W ® A - K k D c (W ® A))e k -x + q k . 



(11) 



(12) 



The derivation requires some straightforward manipulations and is omitted here. Comparing this to Eq. ([5]), 
we note that the networked estimation error, e k , can be stabilized if and only if, the following pair, 



{W®A,D C )> 



(13) 



is observable. In other words, a gain matrix, K k , exists such that p(W ® A — K k Dc(W ® A)) < 1 (i.e., 
it is a Schur matrix), if and only if (W ® A, Dc) is observable, where p(-) denotes the spectral radius 
of a matrix. As it can be seen from Eq. ( [T3] ), the communication network, W, plays a major role in 
distributed estimation as opposed to the multiple time-scale approach where W is irrelevant. The role of 
W in observability is because of the single time-scale nature of the estimator, see Fig. [2j 
Remark 2: 

• The variables Dc and K k are block-diagonal matrices. 

• Every block diagonal, J2jev z ^JCj, in the matrix Dc, can be thought of as a representation of all 
of the measurements in the extended neighborhood of agent i. 
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Fig. 4. (Left) Matrix structure of the distributed system with no data fusion. Every block diagonal Wu A is a subsystem 
associated with the output/agent i. (Right) Adding data fusion, the intra-connections among these subsystems depends on the 
non-diagonals Wij A, i / j. 



We refer to (W®A, Dc) as the distributed system and Gw^A as the graph associated with the matrix 

A. For better understanding of the structural relevance of the estimator in ([8])-([9]), we first consider W = I 

and Dc defined as follows, 



(14) 



implying no information exchange among the agents. This distributed system, {I® A, Dc), can be thought 
of as N subsystems each of them associated to an n x n block diagonal in the matrix W ® A, see Figj4] 
(Left). Now consider W to have some non-zero non-diagonal entries. As it is shown in Figj4] (Right), 
these entries define the inter-connections among these subsystems. 

To shed more light on this, consider the example given in Fig. [3] where we show a n = 7-state 
dynamical system with N = 3 agents, {a, 6, c} such that agent a measures xs, agent b measures x§ 
and agent c measures xj. Each agent is required to estimate the entire n = 7 dimensional state- vector. 
Without any information fusion each agent only has a partial observation of the system as it is shown in 
Fig. |5j Therefore, each agent has to acquire the missing information via communicating with agents in its 
immediate neighborhood. However, in this illustration, no agent finds any observation in its neighborhood 
in addition to what it already possesses. Information sharing among the agents by applying state and 
output fusion provides more links among the subsystems in the distributed system digraph. This extra 
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Fig. 5. The graph associated with distributed system, (J A, Dc), with no data fusion represented as a sub-system at each 
agent. According to the Theorem [T] each sub-system (agent) is not observable with no data fusion. 



linking among subsystems and outputs, captured by the non-zeros in W and the summation in Dc, has the 
potential to improve the generic observability of the system. In this regard, the main objective is to define 
the structure of the communication matrix W (graph Gw) suc h that the distributed system (W ®A, Dc) 
is generically observable. 

We seek a general method to make each subsystem observable. First, we describe how adding a link 
between two agents changes the graph structure of the distributed system. We explain this by considering 
the same example as in Fig. [3] and Fig. [5] In the case of output-fusion, a link between two agents, for 
example, from agent b to agent a (a ^— 6), implies that agent a has access to agent 6's measurement, 
that is measurement of state x$. However, the state-fusion case is more involved. For example, adding a 
link from agent b to agent a implies a nonzero entry in W, i.e., w a ^o, which reflects in the networked 
system matrix, W ®A, as adding edges to some states in the subsystem associated to agent a from some 
states in the subsystem associated to agent b. This will be discussed in more detail in the next sections. 

IV. Our approach 

We now enlist our assumptions and provide a novel agent classification that will help in establishing 
our results. 



A. Assumptions 

In the rest of the paper, we make the following assumptions: 

(i) The communication between the agents is stable, i.e., the the network is static; 

(ii) For every agent, i, the pairs, (A, Q) or (A, J2jev CjCj), are not necessarily observable; 
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(iii) The system is globally (A, C) -observable, i.e., if we collect all the sensor measurements at a center 
then the dynamical system is observable. 
Assumption (ii), in practice, makes the networked estimation problem more challenging and is where 
this work becomes significantly different from current approaches, see, for example, (7) and references 
therein. Assumption (iii) is a typical assumption in distributed estimation implying the observability of 
centralized estimator; without this, no estimation scheme will work. 

B. State and Agent classification 

To describe our approach, we provide a novel agent classification. Since the system is (A, C) -observable, 
(iii) in Section |IV-A[ we can enlist a disjoint set of cycles and Y -topped paths that covers all the state 
vertices (existence is ensured from condition (i)-Theorem[T]). We are interested in a listing that involves 
the maximal cycles and we denote this set as C. For example, from Fig. |3j the disjoint set of cycles 
and y-topped paths that covers all the state vertices includes {(4, 6, 4), (5, 6), (1, 2, 3, a), (7, c)}, and 
{(4, 5, 6, 4), (1, 2, 1), (7, 7), (3, a)}, among others. However, the latter includes the maximal cycles and 
thus C = {(4, 5, 6, 4), (1, 2, 1), (7, 7), (3, a)}. The following classification is with respect to C. 

(i) Type-a agent is an agent that appears in the F-topped paths in C. For example, agent a in Fig. [3] 

(ii) Type- (3 agent is an agent that measures a state in the parent cycles cycles in £; a parent cycle is a 
cycle that does not have an outgoing link to any other state not belonging to itself. For example, 
agent b in Fig. [3j 

(iii) Type-*/ agent is an agent that measure a state in the child cycles in C\ a child cycle is a cycle that 
is not a parent cycle. For example, agent c in Fig. [3j 

The above agent classification leads to the following definition and lemma. 

Definition 4 (Crucial observation): A crucial observation is a measurement such that removing it 
renders the dynamical system unobservable. 

Lemma 2: The agents of Type-a: and Type-/3 make "crucial" observations while the measurements at 
Type-7 agents are not crucial. 

Proof: Since the Type-/3 agents monitor the parent cycles and there is no outgoing link from a 
parent cycle to any other state outside this cycle, the states in the parent cycles can only be the begin 
vertices of a F-topped path (in order to satisfy condition (ii) in Theorem [T]) when any one of these 
states is connected to an output. Hence all the Type-/3 agents make crucial observations. On the other 
hand, removing a Type-a agent violates the condition (i) in Theorem [T] as the attached state vertex is not 
included in C anymore. Hence, Type-a agents are also crucial. Finally, the only location for the Type-7 
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TABLE I 

This table shows distributed system for different fusion levels. 

Fusion level equivalent distributed system 
No data fusion (I <g) A, 5c) 

Only state fusion (W <g) A, ~Dc) 

Only output fusion (I ® A, Dc) 

Both measurement and state fusion (W A, Dc) 



agent is monitoring a child SCC, which either has a directed path to a Type-a agent or to a Type-/3 agent 
and hence is redundant. While the contribution of this state remains in C due to the cycle present there. 

■ 

For example, in Fig. [3] if either agent a or agent b is removed, then the system becomes unobservable. 
It can also be verified that agent c is non-crucial. Having defined types of agents, we note that the 
observability of the distributed system can be recovered via either W ® A matrix (state-fusion) or Dc 
(output- fusion). Here, we provide the minimal sufficient number of communication among the agents. 
Unlike our previous works (26), (27), we do not impose any constraint on the system matrix, A. 
Furthermore, the generic approach is further robust to uncertain systems and to linearized approximation 
of nonlinear models where the structure is fixed while the values are a function of the operating point 
(29). 

V. Recovering observability 

In this section, we first present some helpful results for the development of the paper and then find 
a general solution for (W ® A, Dc) observability. We first discuss the role of state fusion, related to 
the structure of matrix W, and then the role of output fusion, related to the structure of matrix Dc (see 
Table [I}. The reason is to get more intuitive and separate solutions for state and output fusion; obviously, 
in real applications if two agents are linked together they nay share all of their information, including both 
their measurement and state estimates, to maximally improve their current state estimates. The results 
and proofs in this section are mainly graph theoretic that is a direct consequence of using the generic 
approach. 

A. Results on rank genericity 

The result below follows from Theorem [T] and Lemma [T] as provided in Section [TTj 
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Corollary 1 (Full S-rank): A system matrix, A, is full S'-rank if and only if its associated digraph has 
a disjoint union of cycles covering all the state vertices. 

Notice that non-zero diagonals of a matrix can be represented as a disjoint union of self-cycles in its 
associated digraph. From Corollary [T] and by Remark [T] (Section [Hi]) we obtain the following result. 
Corollary 2: The communication matrix, W, has a disjoint union of self-cycles and, 

S-mnk(W) = N. (15) 

This is always true because W has non-zero diagonals, i.e., wu ^ 0, Vi. Consequently, we state the 
following lemma for the networked system (W ® A). 

Lemma 3: For the communication matrix, WnxN, an d system matrix, A nxn , the networked sys- 
tem W ® A is structurally full-rank if and only if A is structurally full-rank. Mathematically, 

S-rank(W ®A) = N xn <^ S-rank(A) = n (16) 

Proof: Recall that for two matrices, W and A, 

rankiW <g> A) = rankiW) x rank(A) (17) 

From Corollary [2j we have rankiW) = N for almost all numerical values, and for any full rank 
matrix A nxn , we have 

mdix(rank(W <g> A)) = N x n, (18) 

Based on the definition of the S'-rank, we can conclude that (W ® A) is generically full rank for almost 
all choices of numerical values. This proves the necessity. On the other hand, if rank (A) < n for any 
choice of W, then we have 

mdix(rank(W <g> A)) < N x n, (19) 

which implies that, 

S-mnk(W ® A) < N x n. (20) 
This proves the sufficiency. ■ 

B. State fusion 

We now explore Eq. ([8]) in NKE protocol and assume that there is no output fusion. In particular, we 
analyze the structure of the matrix W for (W <8> A, Dc) observability according to Table |l| First, we 
provide some special cases where the system matrix, A, is structurally full rank. This is the case, for 
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example, in linearization and discretization of non-linear systems where the system matrix almost always 
has non-zero diagonal entries. 

Lemma 4: For full S-rank system matrix, (A, C) is centrally observable if and only if every parent SCC 
is output connected, i.e., monitored by (at least) one agent. 



The proof is straightforward and omitted here. Interested readers may see our previous work in [27]. The 
following theorem establishes conditions on the communication network, GV, over full 5-rank systems. 



Theorem 2: With a full S-rank system, A, the pair (W ® A, Dq) is generically observable when for 
every parent-SCC in A, say /C, if agent i does not have an observation of a state in /C, then in the 
communication network, GV, there must be a directed path from agent i to any agent j, which has a 
state observation ir0 K. 

Proof: The system matrix A being full 5-rank ensures the condition (i) in Theorem [T] This is because 
from Corollary [T] there exists a disjoint union of cycles alone that cover all of the state vertices and 
the y-topped paths are not needed to verify condition (i). To satisfy condition (ii), all state vertices in a 
subsystem associated to every agent, say z, must be a begin vertex of a F-topped path. This condition, 
according to Lemma |4j is satisfied by having every parent-SCC in W ® A be output-connected. 

Since in communication matrix W there is a path from agent i to j, in Gw®A graph, subsystem of agent 
i is connected to subsystem of agent j. Therefore, every state vertex in parent-SCC /C in subsystem i is 
connected to parent-SCC /C in subsystem j (see Fig. [6]). Since every state of parent-SCC /C in subsystem 
j is y-topped path to output j, every state of parent-SCC /C in subsystem i is also connected to output j. 
With this for every parent-SCC /C in every subsystem i of Gw®A, all parent-SCCs are output connected 
and the theorem follows. ■ 

For example, consider again the three-output system in Fig. [5| Having vertices {4, 5, 6} as parent-SCC, 
agent b is the Type-o: agent. According to the above theorem any other agent without any observation 
in {4,5,6}, like agent a, must be connected to agent b. This provides a connection from parent- 
SCC {4, 5, 6} in subsystem a to its counterpart SCC in subsystem b in distributed system graph Gw®A, 
and in turn its output connectivity. 

A very important point to mention here is that for full S-rank systems, there only exist Type-/3 and 
Type-7 agents. We prove this in the following lemma. 

2 If there is more than one agent observing SCC AC, say agents j, k, a directed path from agent i to only one of them is 
sufficient. 
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Fig. 6. This figure illustrates the proof of the Theorem [2] showing that: A directed path from agent i to agent j in W matrix 
(left) implies a directed path from states in subsystem i to subsystem j and consequently agent j in (W A) matrix. 



Lemma 5: If a system matrix has full S-rank then we only have Type-/3 or Type-7 agents. 

Proof: The proof is straightforward and relies on Corollary [T] Since A is full S-rank, there exists a set 
of disjoint cycles that covers all of the state vertices. Hence, the set C introduced for agent classification 



in Section |IV-B| does not include any F-topped paths and thus we cannot have any Type-a agent. ■ 
However, when the system matrix is not full 5-rank, we also encounter Type-a agents that possess crucial 
observations from Lemma [2] It turns out that if the system matrix is not full S-rank, then even using a 
fully-connected communication network (complete Gw graph) does not recover observability. We now 
provide our main result on state fusion. 

Theorem 3: Assume that (A, Cj) is not observable at any agent i. If system, A, is not full S-rank, 
then the NKE ([8])-([9]) is not observable with state fusion alone, i.e., (W ® A, Dc) is not observable for 
any choice of the communication matrix W. 

Proof: Let i be an agent for which condition (i) in Theorem [T] does not hold, i.e., 

A 



S-rank 



cm 



< n. 



(21) 



Such an agent always exists because: (i) based on the assumption (ii) in Section [III} the entire system is 
not observable at any agent; and (ii) the matrix A is not full S-rank. Now consider (W ® A, Dc) for 
the best-case scenario where Gw is an all- to-all network. Let Wi be the ith column of the adjacency 
matrix W. Obviously, Wi (g) A is the zth block column of (W ® A), and contains block matrices WjiA 
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W Ni [A] 
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W 1N [A] 
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Fig. 7. The figures illustrates the structure of W% A (left) and matrix W A (right) in the proof of Theorem [3] 



where Wji ^ is the element in jth row and zth column of the full matrix W. It follows that 



S-rank 



Wji A 
Cffd 



< n, 



(22) 



for all j = 1, N as wji ^ and scalar multiplication does not change the structure and the S-rank 
(maximum possible rank over all values). Since A is not full S-rank, W% ® A has rank less than n 
as stacking matrices with the same structure on top of each other (see FigJ7]-Left) does not improve 
the Shrank, which immediately results in 

7 f W r ® A 



S-rank 



cm 



< n. 



(23) 



S-rank 



< Nn. 



(24) 



Consequently, according to FigJTJ-right, the structure of the matrix W ® A is given as the side-by-side 
concatenation of the matrices W% ® A. Thus we have, 

W® A 

This holds for almost all choices of non-zero elements in the W matrices. Clearly, for any lower Sh- 
rank W the rank of W ®A cannot be recovered as well. Therefore, according to Lemma [T] the condition 
(i) in Theorem [T] is violated. ■ 
The above theorem shows that when A is S-rank deficient, then using state fusion cannot always 
guarantee the observability of the system and thus, the agents need access to more measurement data to 
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recover observability, which is discussed next. In contrast, Theorem [2] shows that when A is full S-rank, 
then state fusion alone can recover observability and further provides a method for the required agent 
communication. Clearly, these two results are to be viewed as a direct consequence of Assumptions (ii) 



and (iii) in Section IV-A| To the best of our knowledge, these conditions have not been developed before. 



C. Output fusion 

The other solution to make the NKE observable is output fusion, that is the second update level given 
in the equation ([9]). According to the formulation, each agent shares its measurement with its direct 
neighbors and implements this as an innovation to update its prediction. According to Table |l| for output 
fusion the structure of the matrix Dq has to be determined such that I ® A, Dc is observable. 

Based on the definition of Dc, the ith n x n diagonal block of Dc contains all of the measurements 
in the extended neighborhood of agent i. In the distributed system graph Gf I( $^ Dc \, say for the agent z, 
this is equivalent to adding all measurements in the neighborhood Mi to the subsystem i. 

We now provide our main result on output fusion. 

Theorem 4: The system (/ ® A, Dc) is observable if and only if: 

(i) The sub-graph of all Type-a and Type-/3 agents is a complete graph, i.e. all these crucial agents 
are needed to be directly linked together; 

(ii) Every Type-7 agent is directly connected to all Type-a and Type-/3 agents. 
Proof: 

Sufficiency: With the given conditions (i) and (ii), each agent has access to all crucial measurements. 
This makes every agent generically observable. 

Necessity: If an agent is not connected to one of the crucial agents, then it is missing a crucial 
measurement and the statement follows. ■ 
It can be verified that if a system is not (A, C) observable then even using a fully-connected communi- 
cation network does not recover observability. Clearly, the only way to get a stable estimation error is 
by increasing the number of state observations and recovering the (A, C) observability. An interesting 
result on how to recover (A, C) observability can be found in |32). 



D. Main result 

Finally, we consolidate our results in previous subsections on state and output fusion. Theorem [2] 
sets the condition for state fusion for full Shrank systems, i.e., conditions for (W ® A, Dc) generic 
observability, while Theorem [3] states that for general 5-rank deficient systems networked observability 
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cannot be achieved via the state fusion alone. Output fusion, i.e., generic observability of (/ ® A, Dq), 
is discussed in Theorem [4] Combining these results, we now provide the main theorem on generic 
observability of the single time-scale NKE protocol in Eqs. ([S])-©. 

Theorem 5: For (W ® A, Dq) observability with minimal number of communications, each agent 
needs: 

1) A direct link from all the Type-a agents (output-fusion); 

2) A directed path to (at least) one Type-/3 agent for every parent SCC of A. This means, if there is 
two or more agents observing the same parent SCC, a directed path to any one of them is sufficient 
(state-fusion). 

Proof: The proof is a direct consequence of Theorems [2j [3j and [4] ■ 

The following is a complementary remark to the Theorem [5] 

Remark 3: In the case of Type-/3 agents, every agent requires either a directed path to each Type-/3 
agent (as stated in the Theorem [5} or a direct link from each Type-/3 agent (as stated in the Theorem]?]); 
either one of these two conditions is sufficient for observability. However, the first strategy requires less 
number of links compared to the latter one, and therefore, it is preferred in terms of the minimal number 
of links in the communication network. 

Notice that compared to the typical assumptions on the agents' network in the literature, like strong 
connectivity or having a cyclic path, here we provide milder condition on the non-crucial agents; as 
there is no need for connectivity to these agents but from these agents. Furthermore, an agent may have 
no measurement of the system and still be able to estimate the state of the system via the proposed 
strategies. Such agents, for example, may play a role to provide and maintain the connectivity of the 



agent communication network |48|, or even, maintaining directed paths to Type-/3 agents as stated in the 



second condition of the Theorem |5j 

VI. Design of local estimator gain 

In this subsection, we consider the design of the estimator gain matrix, K^. Notice that having (W ® 
A,Dc) observable guarantees a full gain matrix, K^, to stabilize the NKE error. However, according to 
protocol ([9]), we need a local gain matrix, Kk, which is block-diagonal with N blocks of n x n matrices. 
For this section, we assume a constant estimator gain matrix is applied, i.e., the matrix is independent 
of time, k, and denote it by K. 

A partial list of references devoted to find constrained estimator gain for control and estimation is [j26| , 
|49|-[52]. Here, we use the Linear Matrix Inequality (LMI) approach in (50), [ 5TJ . However, in general, 
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the corresponding LMIs do not have a solution, because of the structural constraints (block-diagonal) on 
the gain matrix, K. This is the main difficulty in distributed estimation and control as convex/semidefinite 
approaches are not directly applicable. To this end, we implement an iterative procedure to solve LMIs 
under structural constraints. 

In this regard, the following lemma presents the optimization approach to solve the estimator gain 
design problem. Interested readers may find more details in (26|, pO) . 

Lemma 6: If the NKE protocol ([8])-(|9]) is observable, then estimator gain matrix, K, is the solution 
of the following optimization problem. 

min trace(XY) 

subject to X, Y > 0, 

X A T 1 f X T 1 (25) 



>0, 

A Y 
K is block-diagonal. 



X I 
I Y 



>0, 



where, 



A = W ® A - KD C (W ® A) 



(26) 



In fact, we need a block-diagonal K such that A is Schur (i.e. p(A) < 1). 

Notice that, the solution to the second LMI is equivalent to X = Y _1 , which gives the minimum 
trace and the optimal value as nN. The nonlinear product of X and Y can be replaced with a linear 



approximation | |50| , pT| , [ [53) , (j>u n {X, Y) = trace (lo-X" + XqS) and an iterative algorithm [51 1 can be 
used to minimize trace (XY) under the given constraints. 



Algorithm 1 Iterative calculation of local gain estimator, K. 



0: Find feasible points X0, Y0, K. If no such points exist, Terminate. 



0: At iteration t > minimize trace(Y^ + X t Y) under the constraints given in ( [25] ) and find X, Y, K. 
0: If p(A) < 1 terminate, otherwise set Y t +i = Y, X t +\ = X and run the step 2 for next iteration 
t = t + 1. 



It is shown in pT| that trace(Y^X + XtY) is a non-increasing sequence that converges to 2nN. In this 
regard, a stopping criterion in step 3 of the above algorithm can also be established in terms of reaching 
within 2nN + e of the trace objective. The iterative procedure given above is centralized, however, the 
center has to implement this process only once, off-line; then it transmits the estimator gains to each 
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agent and plays no further role in the implementation of local estimators; each agent, subsequently, 
observes and performs in-network operations to implement the estimator. A single time-scale algorithm 
can also be employed, where the above iterative procedure is implemented at the same time- scale k as 
of the dynamical system. With this approach, the estimator gain iterations, K t , at each t is applied to 
the estimator at time-step, k, and may be transmitted to each agent at each step k. This is helpful when 
the implementation is assumed in real-time. 

VII. Illustrative example and simulation 
Consider the system, (A, C), given in Fig. [3] The structure of these matrices is given by 
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(27) 
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Now, recall that based on Theorem [TJ the system is globally observable by collection of the three 
measurements. The state partitioning with maximal cycles so that this partitioning covers all the states 
(in light of condition (i)-Theorem [TJ is C = {(4, 5, 6, 4), (1, 2, 1), (7, 7), (3, a)}. By definition, agent a 
is Type-a, agent b is Type-/3 and agent c is Type-7. It can further be verified that agents a and b possess 
crucial observations. 

To better illustrate the networking effect, we first note that the networked system with no information 
sharing, i.e., the graph associated to (I® A, D c ), is not observable at any of the agents individually (Fig. [5]). 
To make the networked system observable at each agent, we propose the following communication 
matrices W\ and W2, and their associated graphs Gw in Fig- [§] as two minimal networks making the 
system observable. 
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Fig. 8. (Left) communication network for the three agents using output fusion (Theorem [4|, (Right) both measurement and 
state fusion (Theorem [5}. 
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The graph associated to W\ is proposed based on the Theorem [4] In this network crucial agents, {a, 6}, 
are directly linked among each other and both have a directed link to agent, c, with no crucial observation. 
The second communication network W2 is based on Theorem [5} there is a direct link from agent a (Type- 
a) to all other agents, and there is a path from every other agent to agent b (Type-/3). It can be verified 
that for both topologies (W ® A, Dc) is generically observable. Note that the solution for the network 
design problem is not unique, and there maybe other examples of communication network satisfying 
the conditions in the last section. In addition, any network including one of these two topologies as a 
sub-graph is also a solution to the NKE problem. For example, under the full structured rank assumption 
of system A, any strongly connected network among agents suffices for individual observability (26). 

For simulation, we consider a random valued matrix, A, with the structure in Eq. ( [27] ), and an output 
matrix, Eq. ( [28] ), with all non-zero entries equal to 1. The system eigenvalues are as follows. 

eig(A) = [-1.0838, 1.0838, 0.6511, -0.5571, . . . 

-0.0940,0.0000,1.3072] 

Clearly, the dynamical system is unstable, since p(A) = 1.0838. We choose the agents' network according 
to Fig. [8] (Right) with random link weights such that it remains stochastic. We use Algorithm [T] to find 
the block-diagonal gain matrix, K. Using this gain matrix, the eigen- values of the error dynamics, i.e., 
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Fig. 9. Performance of the networked estimator at each agent. The error is squared and then summed over n = 7 states at 
each iteration. 



of the matrix, A, are as follows, 

eig(A) = [0.8190, 0.6511, -0.5571, 0.0073 ± 0.3159z, . . . 

-0.2700, 0.1643, -0.1406, -0.0940, 

-0.0788, 0.0214, -0.0237, 0, 0, 0, 0, 0, 0, 0, 0, 0], 

which are all stable. The system and output noise are, respectively, v& ~ AA(0, 0.05) and y\ ~ A/"(0, 0.2). 
As system initial state we choose a random initial value between and 3. The system error evolution 
over 100 iterations for three agents are given in Fig. [9] For the visual clarity, we have squared the errors 
at each iteration and then summed them over the n = 7 states of the dynamical system. As it can be 
seen, the estimation error at all agents is bounded despite the fact that system is not stable. 

VIII. Conclusion 

In this paper, we study the role of the agent communication network towards error stability of the 
NKE protocol ([8])-([9]) in the context of single time-scale distributed estimation. As opposed to multiple 
time-scale strategies where the communication network is irrelevant and diffusion strategies where the 
estimator error is irrespective of system dynamics, here, we take into account both system dynamics and 
communication network. We show that the NKE is able to track even potentially unstable dynamical 
systems, i.e., the networked estimator is observable for all stable and unstable eigenvalues. We show 
that under sufficient communication among the agents, the system state is generically observable at 
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every agent. Here, we provide minimal sufficient network connectivity applicable for multi-agent systems 
subject to constrained communication, e.g., out-of-range geographical distances or costly communication. 
We define three types of agents/measurement where two types are crucial for observability. We provide 
two main results on recovering networked observability: (i) with state-fusion, and (ii) with output-fusion. 
Furthermore, we determine dynamical systems (S-rank deficient) for which no state-fusion results in an 
observable networked estimator and one has to rely on output-fusion as well. 

Our results are on the existence of a network structure for bounded estimation error and further finding 
such network with minimal links. Because of the genericity, the link weights are free parameters and 
results are independent of any particular fusion rule (e.g., Metropolis-Hastings |54|) chosen in ([8])- 
([9]). Nevertheless, the structure of the underlying agent communication remains relevant and leads to 
network/infrastructure design. Furthermore, link weights can be optimized to reduce the error, which is 
a direction for the future work. It is worth noting that, in general, S-rank and other generic properties 



are easily verified. For example, there are efficient graph theoretic, |28|, flow theoretic, (55), and linear 



programming, |56|, methods that can be employed to check for generic properties. 
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